SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: CRAIG, NANCY L 

(ii) TITLE OF INVENTION: GAIN OF FUNCTION MUTATIONS IN 
ATP - DEPENDENT TRANSPOSITION PROTEINS 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Anne Brown 

(B) STREET: 3605 Glenwood 

(C) CITY: Raleigh 

(D) STATE: NC 

(E) COUNTRY : USA 

(F) ZIP: 27608 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1-0, Version # 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION : 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Brown, Anne 

(B) REGISTRATION NUMBER: 36,463 

(C) REFERENCE / DOCKET NUMBER : 5789-3 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 919 420 2205 

(B) TELEFAX : 919 881 3175 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 1 . . 1668 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



COPY 



(Alston Sc Bird, LLP) 
Ave . 



1 



ATG AGT GCT ACC CGG ATT CAA GCA GTT TAT CGT GAT ACG GGG GTA GAG 4 8 

Met Ser Ala Thr Arg lie Gin Ala Val Tyr Arg Asp Thr Gly Val Glu 
15 10 15 

GCT TAT CGT GAT AAT CCT TTT ATC GAG GCC TTA CCA CCA TTA CAA GAG 9 6 

Ala Tyr Arg Asp Asn Pro Phe lie Glu Ala Leu Pro Pro Leu Gin Glu 

2 0 2 5 3 0 

TCA GTG AAT AGT GCT GCA TCA CTG AAA TCC TCT TTA CAG CTT ACT TCC 14 4 

Ser Val Asn Ser Ala Ala Ser Leu Lys Ser Ser Leu Gin Leu Thr Ser 
35 40 45 

TCT GAC TTG CAA AAG TCC CGT GTT ATC AGA GCT CAT ACC ATT TGT CGT 132 
Ser Asp Leu Gin Lys Ser Arg Val He Arg Ala His Thr He Cys Arg 
50 55 60 

ATT CCA GAT GAC TAT TTT CAG CCA TTA GGT ACG CAT TTG CTA CTA AGT 24 0 

He Pro Asp Asp Tyr Phe Gin Pro Leu Gly Thr His Leu Leu Leu Ser 
65 ~ 70 75 80 

GAG CGT ATT TCG GTC ATG ATT CGA GGT GGC TAC GTA GGC AGA AAT CCT 288 
Glu Arg He Ser Val Met He Arg Gly Gly Tyr Vdl Gly Arg Asn Pro 
8 5 9 0 9 5 

AAA AC A GGA GAT TTA CAA AAG CAT TTA CAA AAT GGT TAT GAG CGT GTT 336 
Lys Thr Gly Asp Leu Gin Lys His Leu Gin Asn Gly Tyr Glu Arg Val 
100 105 HO 

CAA ACG GGA GAG TTG GAG ACA TTT CGC TTT GAG GAG GCA CGA TCT ACG 3 84 

Gin Thr Gly Glu Leu Glu Thr Phe Arg Phe Glu Glu Ala Arg Ser Thr 
115 12D 125 

GCA CAA AGC TTA TTG TTA ATT GGT TGT TCT GGT AGT GGG AAG ACG ACC 4 32 

Ala Gin Ser Leu Leu Leu He Gly Cys Ser Gly Ser Gly Lys Thr Thr 
130 135 140 

TCT CTT CAT CGT ATT CTA GCC ACG TAT CCT CAG GTG ATT TAC CAT CGT 4 80 

Ser Leu His Arg He Leu Ala Thr Tyr Pro Gin Val He Tyr His Arg 
145 150 155 160 

GAA CTC AAT GTA GAG CAG GTG GTG TAT TTG AAA ATA GAC TGC TCG CAT 52 8 

Glu Leu Asn Val Glu Gin Val Val Tyr Leu Lys He Asp Cys Ser His 
165 170 175 

AAT GGT TCG CTA AAA GAA ATC TGC TTG AAT TTT TTC AGA GCG TTG GAT 5 76 

Asn Gly Ser Leu Lys Glu He Cys Leu Asn Phe Phe Arg Ala Leu Asp 
180 185 190 

CGA GCC TTG GGC TCG AAC TAT GAG CGT CGT TAT GGC TTA AAA CGT CAT 6 24 

Arg Ala Leu Gly Ser Asn Tyr Glu Arg Arg Tyr Gly Leu Lys Arg His 
195 20 0 205 

GGT ATA GAA ACC ATG TTG GCT TTG ATG TCG CAA ATA GCC AAT GCA CAT 6 72 

Gly He Glu Thr Met Leu Ala Leu Met Ser Gin He Ala Asn Ala His 
210 215 220 

GCT TTA GGG TTG TTG GTT ATT GAT GAA ATT CAG CAT TTA AGC CGC TCT 72 0 
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Ala Leu Gly Leu Leu Val He Asp Glu He Gin His Leu Ser Arg Ser 
225 J 230 235 240 

CGT TCG GGT GGA TCT CAA GAG ATG CTG AAC TTT TTT GTG ACG ATG GTG 76 8 

Arg Ser Gly Gly Ser Gin Glu Met Leu Asn Phe Phe Val Thr Met Val 
24 5 250 255 

AAT ATT ATT GGC GTA CCA GTG ATG TTG ATT GGT ACC CCT AAA GCA CGA 816 
Asn He He Gly Val Pro Val Met Leu He Gly Thr Pro Lys Ala Arg 
260 265 270 

GAG ATT TTT GAG GCT GAT TTG CGG TCT GCA CGT AGA GGG GCA GGG TTT 8 64 

Glu He Phe Glu Ala Asp Leu Arg Ser Ala Arg Arg Gly Ala Gly Phe 

275 280 285 

GGA GCT ATA TTC TGG GAT CCT ATA CAA CAA ACG CAA CGT GGA AAG CCC 912 
Gly Ala He Phe Trp Asp Pro He Gin Gin Thr Gin Arg Gly Lys Fro 
290 295 300 

AAT CAA GAG TGG ATC GCT TTT ACG GAT AAT CTT TGG CAA TTA CAG CTT 96 0 

Asn Gin Glu Trp He Ala Phe Thr Asp Asn Leu Trp Gin Leu Gin Leu 
305 310 315 320 

TTA CAA CGC AAA GAT GCG CTG TTA TCG GAT GAG GTC CGT GAT GTG TGG 1008 
Leu Gin Arg Lys Asp Ala Leu Leu Ser Asp Glu Val Arg Asp Val Trp 

325 330 335 

TAT GAG CTA AGC CAA GGA GTG ATG GAC ATT GTA GTA AAA CTT TTT GTA 1056 
Tyr Glu Leu Ser Gin Gly Val Met Asp He Val Val Lys Leu Phe Val 
340 345 350 

CTC GCT CAG CTC CGT GCG CTA GCT TTA GGC AAT GAG CGT ATT ACC GCT 1104 
Leu Ala Gin Leu Arg Ala Leu Ala Leu Gly Asn Glu Arg lie Thr Ala 

355 360 365 

GGT TTA TTG CGG CAA GTG TAT CAA GAT GAG TTA AAG CCT GTG CAC CCC 1152 
Gly Leu Leu Arg Gin Val Tyr Gin Asp Glu Leu Lys Pro Val His Pro 
370 375 380 

ATG CTA GAG GCA TTA CGC TCG GGT ATC CCA GAA CGC ATT GCT CGT TAT 12 00 

Met Leu Glu Ala Leu Arg Ser Gly He Pro Glu Arg He Ala Arg Tyr 

385 390 395 400 

TCT GAT CTA GTC GTT CCC GAG ATT GAT AAA CGG TTA ATC CAA CTT CAG 124 8 

Ser Asp Leu Val Val Pro Glu He Asp Lys Arg Leu He Gin Leu Gin 
405 410 415 

CTA GAT ATC GCA GCG ATA CAA GAA CAA ACA CCA GAA GAA AAA GCC CTT 12 96 

Leu Asp He Ala Ala He Gin Glu Gin Thr Pro Glu Glu Lys Ala Leu 
42 0 42 5 4 30 

CAA GAG TTA GAT ACC GAA GAT CAG CGT CAT TTA TAT CTG ATG CTG AAA 1344 
Gin Glu Leu Asp Thr Glu Asp Gin Arg His Leu Tyr Leu Met Leu Lys 
435 440 445 

GAG GAT TAC GAT TCA AGC CTG TTA ATT CCC ACT ATT AAA AAA GCG TTT 13 92 

Glu Asp Tyr Asp Ser Ser Leu Leu He Pro Thr He Lys Lys Ala Phe 
450 455 460 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 5 56 ammo acids 

(B) TYPE: ammo acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 
Me, Ser Ala Thr Arg Ile Gln Ala Val Tyr ^ ^ ^ ^ 

10 15 

Ala Tyr Arg Asp Asn Pro Phe lie Gin n a , 

20 la Leu Pro Pro Leu Gin Glu 

25 30 

Ser Val Asn Ser Ala Ala Ser Leu Lys Ser Ser Leu Gin Leu Thr Ser 

40 45 

Ser Asp Leu Gin Lys Ser Ara 1 a 

50 7 A ^ Val Ile Ar ^ Al * His Thr He Cys Arg 

6 0 

lie Pro Asp Asp Tyr Phe Gin Pro Leu Gly Thr Hi. t 
65 nn tu inr His Leu Leu Leu Ser 

75 

Glu Arg Xle Ser Val Me t Xl e Arg Gly Gly Tyr Val Gly Arg Asn p " 

90 

Thr Gly Asp Leu Gin Lys „ is Leu Gln Asn Qly ^ ^ ^ ^ 



1440 



1488 



1536 



1584 



1632 



105 110 



4 



Gin Thr Gly Glu Leu Glu Thr Phe Arg Phe Glu Glu Ala Arg Ser Thr 
115 120 125 

Ala Gin Ser Leu Leu Leu lie Gly Cys Ser Gly Ser Gly Lys Thr Thr 
130 135 140 

Ser Leu His Arg lie Leu Ala Thr Tyr Pro Gin Val lie Tyr His Arg 
145 150 155 160 

Glu Leu Asn Val Glu Gin Val Val Tyr Leu Lys lie Asp Cys Ser His 
165 170 175 

Asn Gly Ser Leu Lys Glu lie Cys Leu Asn Phe Phe Arg Ala Leu Asp 
18 0 185 190 

Arg Ala Leu Gly Ser Asn Tyr Glu Arg Arg Tyr Gly Leu Lys Arg His 
195 200 205 

Gly lie Glu Thr Met Leu Ala Leu Met Ser Gin lie Ala Asn Ala His 
210 215 220 

Ala Leu Gly Leu Leu Val lie Asp Glu lie Gin His Leu Ser Arg Ser 
225 23u A6*z> 240 

Arg Ser Gly Gly Ser Gin Glu Met Leu Asn Phe Phe Val Thr Met Val 
245 250 255 

Asn lie lie Gly Val Pro Val Met Leu lie Gly Thr Pro Lys Ala Arg 
260 265 270 

Glu lie Phe Glu Ala Asp Leu Arg Ser Ala Arg Arg Gly Ala Gly Phe 

275 280 285 

Gly Ala lie Phe Trp Asp Pro lie Gin Gin Thr Gin Arg Gly Lys Pro 
290 295 300 

Asn Gin Glu Trp lie Ala Phe Thr Asp Asn Leu Trp Gin Leu Gin Leu 

305 310 315 320 

Leu Gin Arg Lys Asp Ala Leu Leu Ser Asp Glu Val Arg Asp Val Trp 
325 330 335 

Tyr Glu Leu Ser Gin Gly Val Met Asp lie Val Val Lys Leu Phe Val 

340 345 350 

Leu Ala Gin Leu Arg Ala Leu Ala Leu Gly Asn Glu Arg lie Thr Ala 
355 360 365 

Gly Leu Leu Arg Gin Val Tyr Gin Asp Glu Leu Lys Pro Val His Pro 

370 375 380 

Met Leu Glu Ala Leu Arg Ser Gly lie Pro Glu Arg lie Ala Arg Tyr 
385 390 395 400 

Ser Asp Leu Val Val Pro Glu lie Asp Lys Arg Leu lie Gin Leu Gin 
405 410 415 

Leu Asp lie Ala Ala lie Gin Glu Gin Thr Pro Glu Glu Lys Ala Leu 
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420 

Gin Glu Leu Asp 
435 

Glu Asp Tyr Asp 
450 

Ser Gin Asn Pro 
465 

Gin Trp Leu Met 



Ser Lys Ser Lys 

500 

Trp Asp Ser Leu 
515 

Gin Pro Glu Lys 

530 

Val Asp Met Ala 
545 



Thr Glu Asp Gin 
440 

Ser Ser Leu Leu 
455 

Thr Met Thr Arg 

470 

Glu Gly Glu Thr 
485 

Lys Val Ser Ala 



Pro Asp Thr Asp 

520 

Thr Met His Glu 

535 

Ser Leu Phe Lys 
550 



425 

Arg His Leu Tyr 



He Pro Thr He 
460 

Gin Lys Leu Leu 

475 

Val Val Ser Glu 
490 

He Lys Val Val 

50b 

Leu Arg Tyr He 



Arg Leu Lys Gly 
54 0 

Gin Ala Gly * 
555 



430 

Leu Met Leu Lys 
445 

Lys Lys Ala Phe 



Pro Leu Val Leu 

480 

Leu Glu Lys Pro 
495 

Lys Pro Ser Asp 
510 

Tyr Ser Gin Arg 
525 

Lys Gly Val He 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "pEM delta R . ad j to 1" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



TTTAGAGCAA 


TTCGGTGTTA 


GTTTCAGCAA 


GCAAACATTA 


ACCATAGCTA 


ATGATTTATA 


60 


GCCATATTAA 


CCATTGGGGT 


ACCGAGCTCG 


AATTCCATGG 


TCTGTTTCCT 


GTGTGAAATT 


120 


GTTATCCGCT 


CACAATTCCA 


CACATTATAC 


GAGCCGGATG 


ATTAATTGTC 


AACAGCTCAT 


180 


TTCAGAATAT 


TTGCCAGAAC 


CGTTATGATG 


TCGGCGCAAA 


AAACATTATC 


CAGAACGGGA 


240 


GTGCGCCTTG 


AGCGACACGA 


ATTATGCAGT 


GATTTACGAC 


CTGCACAGCC 


ATACCACAGC 


300 


TTCCGATGGC 


TGCCTGACGC 


CAGAAGCATT 


GGTGCACCGT 


GCAGTCGATG 


ATAAGCTGTC 


360 


AAACCAGATC 


AATTCGCGCT 


AACTCACATT 


AATTGCGTTG 


CGCTCACTGC 


CCGCTTTCCA 


420 


GTCGGGAAAC 


CTGTCGTGCC 


AGCTGCATTA 


ATGAATCGGC 


CAACGCGCGG 


GG AG AGGCGG 


480 



6 



540 
600 
660 



780 
840 

900 
960 



TTTGCGTATT GGGCGCCAGG GTGGTTTTTC TTTTCACCAG TGAGACGGGC AACAGCTGAT 
TGCCCTTCAC CGCCTGGCCC TGAGAGAGTT GCAGCAAGCG GTCCACGCTG GTTTGCCCCA 
GCAGGCGAAA ATCCTGTTTG ATGGTGGTTG ACGGCGGGAT ATAACATGAG CTGTCTTCGG 

TATCGTCGTA TCCCACTACC GAGATATCCG CACCAACGCG CAGCCCGGAC TCGGTAATGG 720 
CGCGCATTGC GCCCAGCGCC ATCTGATCGT TGGCAACCAG CATCGCAGTG GGAACGATGC 
CCTCATTCAG CATTTGCATG GTTTGTTGAA AACCGGACAT GGCACTCCAG TCGCCTTCCC 
GTTCCGCTAT CGGCTGAATT TGATTGCGAG TGAGATATTT ATGCCAGCCA GCCAGACGCA 
GACGCGCCGA GACAGAACTT AATGGGCCCG CTAACAGCGC GATTTGCTGG TGACCCAATG 

CGACCAGATG CTCCACGCCC AGTCGCGTAC CGTCTTCATG GGAGAAAATA ATACTGTTGA 1020 

TGGGTGTCTG GTCAGAGACA TCAAGAAATA ACGCCGGAAC ATTAGTGCAG GCAGCTTCCA 1080 

CAGCAATGGC ATCCTGGTCA TCCAGCGGAT AGTTAATGAT CAGCCCACTG ACGCGTTGCG 1140 

CGAGAAGATT GTGCACCGCC GCTTTACAGG PTTCGACGCC GCTTCGTTCT ACGATCGACA 1200 

CCACCACGCT GGCACCCAGT TGATCGGCGC GAGATTTAAT CGCCGCGACA ATTTGCGACG 1260 

GCGCGTGCAG GGCCAGACTG GAGGTGGCAA CGCCAATCAG CAACGACTGT TTGCCCGCCA 132 0 

GTTGTTGTGC CACGCGGTTG GGAATGTAAT TCAGCTCCGC CATCGCCGCT TCCACTTTTT 138 0 

CCCGCGTTTT CGCAGAAACG TGGCTGGCCT GGTTCACCAC GCGGGAAACG GTCTGATAAG 14 4 0 

AGACACCGGC ATACTCTGCG ACATCGTATA ACGTTACTGG TTTCACATTC ACCACOCTGA 1500 

ATTGACTCTC TTCCGGGCGC TATCATGCCA TACCGCGAAA GGTTTTGCAC CATTCGATGG 1560 

TGTCAACGTA AATGCATGCC GCTTCGCCTT CGCGCGCGAA TTGATCTGCT GCCTCGCGCG 16 2 0 

TTTCGGTGAT GACGGTGAAA ACCTCTGACA CATGCAGCTC CCGGAGACGG TCACAGCTTG 1680 

TCTGTAAGCG GATGCCGGGA GCAGACAAGC CCGTCAGGGC GCGTCAGCGG GTGTTGGCGG 174 0 

GTGTCGGGGC GCAGCCATGA CCCAGTGACG TAGCGATAGC GGAGTGTATA CTGGCTTAAC 1800 

TATGCGGCAT CAGAGCAGAT TGTACTGAGA GTGCACCATA TGCGGTGTGA AATACCGCAC 1860 

AGATGCGTAA GGAGAAAATA CCGCATCAGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG 192 0 

CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG 198 0 

TTATCCACAG AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG 2040 

GCCAGGAACC GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC 2100 

GAGCATCACA AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA 2160 

TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT 2220 
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ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA TAGCTCACGC 2280 
TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC 2 34 0 
CCCGTTCAGC CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA 2400 
AGACACGACT TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT 2460 
GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA 2 52 0 
GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT 2 5 80 
TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT 2640 
ACGCGCAGAA AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT 2 7 00 
CAGTGGAACG AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC 2 76 0 
ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA 2 820 
ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA 2880 
TTTCGTTCAT CCATAGTTGC CTGACTCCCC CTCGTGTAGA TAACTACGAT AcG^GAGGGC 2 94 0 
TTACCATCTG GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT 3000 
TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA 3060 
TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT 312 0 
AATAGTTTGC GCAACGTTGT TGCCATTGCT GTAGGCATCG TGGTGTCACG CTCGTCGTTT 318 0 
GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG 32 4 0 
TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC 3 3 00 
GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC 3 36 0 
GTAAGATGCT TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG 34 2 0 
CGGCGACCGA GTTGCTCTTG CCCGGCGTCA ACACGGGATA ATACCGCGCC ACATAGCAGA 34 80 
ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAC-GATCTTA 3540 
CCGCTGTTGA GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT 3600 
TTTACTTTCA CCAGGGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG 36 6 0 
GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA 3 72 0 
AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT 37 80 
AAACAAAAAG AGTTTGTAGA AACGCAAAAA GGCCATCCGT CAGGATGGCC TTCTGCTTAA 3 84 0 
TTTGATGCCT GGCAGTTTAT GGCGGGCGTC CTGCCCGCCA CCCTCCGGGC CGTTGCTTCG 3900 
CAACGTTCAA ATCCGCTCCC GGCGGATTTG TCCTACTCAG GAGAGCGTTC ACCGACAAAC 3960 
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AACAGATAAA ACGAAAGGCC CAGTCTTTCG ACTGAGCCTT TCGTTTTATT TGATGCCTGG 4 02 0 

CAGTTCCCTA CTCTCGCATG GGGAGACCCC ACACTACCAT CGGCGCTACG GCGTTTCACT 4 08 0 

TCTGAGTTCG GCATGGGGTC AGGTGGGACC ACCGCGCTAC TGCCGCCAGG CAAATTCTGT 414 0 

TTTATCAGAC CGCTTCTGCG TTCTGATTTA ATCTGTATCA GGCTGAAAAT CTTCTCTCAT 4 2 00 

CCGCCAAAAC AGCCAAGCTT GCATGCCTGC AGGTCGACTC TAGAGGATCC CCAAGAAAGT 4 26 0 

CCGTCGGACA GCTTTAATAA ACCCTGCACT TATCTGTTTA GTGTGGGCGG ACAAAATAGT 4320 

TGGGAACTGG GAGGGGTGGA AATGGAGTTT TTAAGGATTA TTTAGGGAAG AGTGACAAAA 4 380 

TAGATGGGAA CTGGGTGTAG CGTCGTAAGC TAATACGAAA ATTAAAAATG ACAAAATAGT 44 4 0 

TTGGAACTAG ATTTCACTTA TCTGGTTGGT CGACCTGCAG GGGGGGGGGG GAAAGCCACG 4 500 

TTGTGTCTCA AAATCTCTGA TGTTACATTG CACAAGATAA AAATATATCA TCATGAACAA 4 56 0 

TAAAACTGTC TGCTTACATA AACAGTAATA CAAGGGGTGT TATGAGCCAT ATTCAACGGG 4620 

AAACGTCTTG CTCGAGGCCG PGATTAAATT CCAACATGGA TGCTGATTTA TATGGGTATA 4 68 0 

AATGGGCTCG CGATAATGTC GGGCAATCAG GTGCGACAAT CTATCGATTG TATGGGAAGC 4 74 0 

CCGATGCGCC AGAGTTGTTT CTGAAACATG GCAAAGGTAG CGTTGCCAAT GATGTTACAG 4 80 0 

ATGAGATGGT CAGACTAAAC TGGCTGACGG AATTTATGCC TCTTCCGACC ATCAAGCATT 4 86 0 

TTATCCGTAC TCCTGATGAT GCATGGTTAC TCACCACTGC GATCCCCGGG AAAACAGCAT 4 92 0 

TCCAGGTATT AGAAGAATAT CCTGATTCAG GTGAAAATAT TGTTGATGCG CTGGCAGTGT 4 98 0 

TCCTGCGCCG GTTGCATTCG ATTCCTGTTT GTAATTGTCC TTTTAACAGC GATCGCGTAT 504 0 

TTCGTCTCGC TCAGGCGCAA TCACGAATGA ATAACGGTTT GGTTGATGCG AGTGATTTTG 510 0 

ATGACGAGCG TAATGGCTGG CCTGTTGAAC AAGTCTGGAA AGAAATGCAT AAGCTTTTGC 516 0 

CATTCTCACC GGATTCAGTC GTCACTCATG GTGATTTCTC ACTTGATAAC CTTATTTTTG 52 2 0 

ACGAGGGGAA ATTAATAGGT TGTATTGATG TTGGACGAGT CGGAATCGCA GACCGATACC 52 8 0 

AGGATCTTGC CATCCTATGG AACTGCCTCG GTGAGTTTTC TCCTTCATTA CAGAAACGGC 53 4 0 

TTTTTCAAAA ATATGGTATT GATAATCCTG ATATGAATAA ATTGCAGTTT CATTTGATGC 54 0 0 

TCGATGAGTT TTTCTAATCA GAATTGGTTA ATTGGTTGTA ACACTGGCAG AGCATTACGC 54 6 0 

TGACTTGACG GGACGGCGGC TTTGTTGAAT AAATCGAACT TTTGCTGAGT TGAAGGATCA 5520 

GATCACGCAT CTTCCCGACA ACGCAGACCG TTCCGTGGCA AAGCAAAAGT TCAAAATCAC 5580 

CAACTGGTCC ACCTACAACA AAGCTCTCAT CAACCGTGGC TCCCTCACTT TCTGGCTGGA 56 4 0 

TGATGGGGCG ATTCAGGCCT GGTATGAGTC AGCAACACCT TCTTCACGAG GCAGACCTCA 5700 
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GCGCCCCCCC CCCCCTGCAG GTCGACCCCA CGCCCCTCTT TAATACGACG GGCAATTTGC 5760 

ACTTCAGAAA ATGAAGAGTT TGCTTTAGCC ATAACAAAAG TCCAGTATGC TTTTTCACAG 582 0 

CATAACTGGA CTGATTTCAG TTTACAACTA TTCTGTCTAG TTTAAGACTT TATTGTCATA 5880 

GTTTAGATCT ATTTTGTTCA GTTTAAGACT TTATTGTCCG CCCACA 5 926 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : s i r : J 1 e 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "pEM- del ta " 



(Xi) SEQUENCE DESCRIPTION: SEQ T n NO : 4 : 
CAGATCAATT CGCGCTAACT CACATTAATT GCGTTGCGCT CACTGCCCGC TTTCCAGTCG 
GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG 
CGTATTGGGC GCCAGGGTGG TTTTTCTTTT CACCAGTGAG ACGGGCAACA GCTGATTGCC 
CTTCACCGCC TGGCCCTGAG AGAGTTGCAG CAAGCGGTCC ACGCTGGTTT GCCCCAGCAG 
GCGAAAATCC TGTTTGATGG TGGTTGACGG CGGGATATAA CATGAGCTGT CTTCGGTATC 
GTCGTATCCC ACT AC CG AG A TATCCGCACC AACGCGCAGC CCGGACTCGG TAATGGCGCG 
CATTGCGCCC AGCGCCATCT GATCGTTGGC AACCAGCATC GCAGTGGGAA CGATGCCCTC 
ATTCAGCATT TGCATGGTTT GTTGAAAACC GGACATGGCA CTCCAGTCGC CTTCCCGTTC 
CGCTATCGGC TGAATTTGAT TGCGAGTGAG ATATTTATGC CAGCCAGCCA GACGCAGACG 
CGCCGAGACA GAACTTAATG GGCCCGCTAA CAGCGCGATT TGCTGGTGAC CCAATGCGAC 
CAGATGCTCC ACGCCCAGTC GCGTACCGTC TTCATGGGAG AAAATAATAC TGTTGATGGG 
TGTCTGGTCA GAGACATCAA GAAATAACGC CGGAACATTA GTGCAGGCAG CTTCCACAGC 
AATGGCATCC TGGTCATCCA GCGGATAGTT AATGATCAGC CCACTGACGC GTTGCGCGAG 
AAGATTGTGC ACCGCCGCTT TACAGGCTTC GACGCCGCTT CGTTCTACCA TCGACACCAC 
CACGCTGGCA CCCAGTTGAT CGGCGCGAGA TTTAATCGCC GCGACAATTT GCGACGGCGC 
GTGCAGGGCC AGACTGGAGG TGGCAACGCC AATCAGCAAC GACTGTTTGC CCGCCAGTTG 
TTGTGCCACG CGGTTGGGAA TGTAATTCAG CTCCGCCATC GCCGCTTCCA CTTTTTCCCG 1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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CGTTTTCGCA GAAACGTGGC TGGCCTGGTT CACCACGCGG GAAAGGGTCT GATAAGAGAC 1080 
ACCGGCATAC TCTGCGACAT CGTATAACGT TACTGGTTTC ACATTCACCA CCCTGAATTG 1140 
ACTCTCTTCC GGGCGCTATC ATGCCATACC GCGAAAGGTT TTGCACCATT CGATGGTGTC 1200 
AACGTAAATG CATGCCGCTT CGCCTTCGCG CGCGAATTGA TCTGCTGCCT CGCGCGTTTC 12 6 0 
GGTGATGACG GTGAAAACCT CTGACACATG CAGCTCCCGG AGACGGTCAC AGCTTGTCTG 13 20 
TAAGCGGATG CCGGGAGCAG ACAAGCCCGT CAGGGCGCGT CAGCGGGTGT TGGCGGGTGT 13 80 
CGGGGCGCAG CCATGACCCA GTCACGTAGC GATAGCGGAG TGTATACTGG CTTAACTATG 14 4 0 
CGGCATCAGA GCAGATTGTA CTGAGAGTGC ACCATATGCG GTGTGAAATA CCGCACAGAT 1.00 
GCGTAAGGAG AAAATACCGC ATCAGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC 1.60 
GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA ATACGGTTAT 162 0 
CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC AAAAGGCCAG CAAAAGGCCA 1680 
GGAACCGTAA AAAGGCrnCG TTGCTCGCGT TTTTCCATAG GCTCCGCCCC CCTGACGAGC 1740 
ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAAGCC GACAGGACTA T AAAG AT AC C 1800 
AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG 186 0 
GATACCTGTC CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCATAGC TCACGCTGTA 192 0 
GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCGCCG 1980 
TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC CCGGTAAGAC 2 04 0 
ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT TAGCAGAGCG AGGTATGTAG 2 100 
GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG CTACACTAGA AGGACAGTAT 216 0 
TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 2220 
CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC 2 2 80 
GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT GACGCTCAGT 2 340 
GGAACGAAAA CTCACGTTAA GGGATTTTGG T C ATG AG ATT ATCAAAAAGG ATCTTCACCT 24 00 
AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATCTA AAGTATATAT GAGTAAACTT 2 46 0 
GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT CTCAGCGATC TGTCTATTTC 2 52 0 
GTTCATCCAT AGTTGCCTGA CTCGCCGTCG TGTAGATAAC TACGATACGG GAGGGCTTAC 2 58 0 
CATCTGGCCC CAGTGCTGCA ATGATACCGC GAGACGCACG CTCACCGGCT CCAGATTTAT 2 64 0 
CAGCAATAAA CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGTCCTGCA ACTTTATCCG 2700 
CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT AAGTAGTTCG CCAGTTAATA 2 76 0 
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GTTTGCGCAA CGTTGTTGCC ATTGCTGTAG GCATCGTGGT GTCACGCTCG TCGTTTGGTA 2820 
TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT TACATGATCC CCCATGTTGT 2880 
GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGGCCGCAG 2 940 
TGTTATCACT CATGGTTATG GCAGCACTGC ATAATTCTCT TACTGTCATG CCATCCGTAA 3000 
GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT CTGAGAATAG TGTATGCGGC 3060 
GACCGAGTTG CTCTTGCCOG GCGTCAACAC GGGATAATAC CGCGCCACAT AGCAGAACTT 3120 
TAAAAGTGCT CATCATTGGA AAACGTTCTT CGGGGCGAAA ACTCTCAAGG ATCTTACCGC 318 0 
TGTTGAGATC CAGTTCGATG TAACCCACTC GTGCACCCAA GTGATCTTCA GCATCTTTTA 3240 
CTTTCACCAG CGTTTGTGGG TGAGCAAAAA CAGGAAGGCA AAATGCCGCA AAAAAGGGAA 3 300 
TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT TTTTCAATAT TATTGAAGCA 336 0 
TTTATCAGGG TTATTGTCTC ATGAGCGGAT ACATATTTGA ATGTATTTAG AAAAATAAAC 3420 
AAAAAG AGT T TOTAUAAACG CAAAAAGGCC ATCCGTCAGG ATGGCCTTCT GCTTAATTTG 348 0 
ATGCCTGGCA GTTTATGGCG GGCGTCCTGC CCGCCACCCT CCGGGCCGTT GCTTCGCAAC 3,4 0 
GTTCAAATCC GCTCCCGGCG GATTTGTCCT ACTCAGGAGA GCGTTCACCG ACAAACAACA 3600 
GATAAAAGGA AAGGCCCAGT CTTTCGACTG AGCCTTTCGT TTTATTTGAT GCCTGGCAGT 3660 
T C C CT ACT C T CGCATGGGGA GACCCCACAC TAGCATCGGC GCTACGGCGT TTCACTTCTG 3720 
AGTT CGGC AT GGGGTCAGGT GC-ACCACCG CGGTACTGCC GCCAGGCAAA TTCTGTTTTA 3780 
T C AG AC CG C T TCTGCGTTCT GATTTAATCT GTATCAGGCT GAAAATCTTC TCTCATCCGC 3840 
CAAAACAGCC AAGCTTGCAT GCCTGCAGGT CGACTCTAGA GGATCCCCAA GAAAGTCCGT 3300 
CGGACAGCTT TAA.TAAACCC TGCACTTATC TGTTTAGTGT GGGCGGACAA AATAGTTGGG 3960 
AACTGGGAGG GGTGGAAATG GAGTTTTTAA GGATTATTTA GGGAAGAGTG ACAAAATAGA 4020 
TGGGAACTGG GTGTAGCGTG GTAAGCTAAT ACGAAAATTA AAAATGACAA AATAGTTTGG 4080 
AACTAGATTT CACTTATCTG GTTGGTCGAC CTGCAGGGGG GGGGGGGAAA GCCACGTTGT 4140 
GTCTCAAAAT CTCTGATGTT ACATTGCACA AGATAAAAAT ATATCATGAT GAACAATAAA 4200 
ACTGTCTGCT TACATAAACA GTAATACAAG GGGTGTTATG AGCCATATTC AACGGGAAAG 4260 
GTCTTGCTCG AGGCCGCGAT TAAATTCCAA CATGGATGCT GATTTATATG GGTATAAATG 4320 
GGCTCGCGAT AATGTCGGGC AATCAGGTGC GACAATCTAT CGATTGTATG GGAAGCGCGA 4380 
TGCGCCAGAG TTGTTTCTGA AACATGGCAA AGGTAGCGTT GCCAATGATG TTACAGATGA 4440 
GATGGTCAGA CTAAACTGGC TGACGGAATT TATGCCTCTT CCGACCATCA AGCATTTTAT 4S00 
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CCGTACTCCT GATGATGCAT GGTTACTCAC CACTGCGATC CCCGGGAAAA CAGCATTCCA 4 56 0 

GGTATTAGAA GAATATCCTG ATTCAGGTGA AAATATTGTT GATGCGCTGG CAGTGTTCCT 46 2 0 

GCGCCGGTTG CATTCGATTC CTGTTTGTAA TTGTCCTTTT AACAGCGATC GCGTATTTCG 468 0 

TCTCGCTCAG GCGCAATCAC GAATGAATAA CGGTTTGGTT GATGCGAGTG ATTTTGATGA 4 74 0 

CGAGCGTAAT GGCTGGCCTG TTGAACAAGT CTGGAAAGAA ATGCATAAGC TTTTGCCATT 4 800 

CTCACCGGAT TCAGTCGTCA CTCATGGTGA TTTCTCACTT GATAACCTTA TTTTTGACGA 4 86 0 

GGGGAAATTA ATAGGTTGTA TTGATGTTGG ACGAGTCGGA ATCGCAGACC GATACCAGGA 4 92 0 

TCTTGC CATC CTATGGAACT GCCTCGGTGA GTTTTCTCCT TC ATT AC AG A AACGGCTTTT 4 98 0 

TCAAAAATAT GGTATTGATA ATCCTGATAT GAATAAATTG CAGTTTCATT TGATGCTCGA 5 04 0 

TGAGTTTTTC TAATCAGAAT TGGTTAATTG GTTGTAACAC TGGCAGAGCA TTACGCTGAC 5100 

TTGACGGGAC GGCGGCTTTG TTGAATAAAT CGAACTTTTG CTGAGTTGAA GGATCAGATC 516 0 

ACGCAiClTC CCGACAACGC AGACCGTTCC GTGGCAAAGC AAAAGTTCAA AATCACCAAC 52 2 0 

TGGTCCACCT ACAACAAAGC TCTCATCAAC CGTGGCTCCC TCACTTTCTG GCTGGATGAT 52 8 0 

GGGGCGATTC AGGCCTGGTA TGAGTC AG C A ACACCTTCTT CACGAGGCAG ACCTCAGCGC 534 0 

CCCCCCCCCC CTGCAGGTCG ACCCCACGCC CCTCTTTAAT ACGACGGGCA ATTTGCACTT 54 0 0 
CAGAAAATGA AGAGTTTGCT TTAGCCATAA CAAAAGTCCA GTATGCTTTT TCACAGCATA 54 6 0 
ACTGGACTGA TTTC AGTTTA CAACTATTCT GTCTAGTTTA AGACTTTATT GTCATAGTTT 55 2 0 
AGATCTATTT TGTTCAGTTT AAGACTTTAT TGTCCGCCCA CATTTAGAGC AATTCGGTGT 55 8 0 
TAGTTTCAGC AAGCAAACAT TAACCATAGC TAATGATTTA TAG C CAT ATT AACCATTGGG 56 4 0 
GTACCGAGCT CGAATTCCAT GGTCTGTTTC CTGTGTGAAA TTGTTATCCG CTCACAATTC 57 00 
CACACATTAT ACGAGCCGGA TGATTAATTG TCAACAGCTC ATTTCAGAAT ATTTGCCAGA 57 6 0 
ACCGTTATGA TGTCGGCGCA AAAAACATTA TCCAGAACGG GAGTGCGCCT TGAGCGACAC 58 2 0 
GAATTATGCA GTGATTTACG ACCTGCACAG CCATACCACA GCTTCCGATG GCTGCCTGAC 58 8 0 
GCCAGAAGCA TTGGTGCACC GTGCAGTCGA TGATAAGCTG TCAAAC 5 92 6 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "pER183 (target plasmid)" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GAATTCCGGA TGAGCATTCA TCAGGCGGGC AAGAATGTGA ATAAAGGCCG GATAAAACTT 6 0 

GTGCTTATTT TTCTTTACGG TCTTTAAAAA GGCCGTAATA TCCAGCTGAA CGGTCTGGTT 12 0 

ATAGGTACAT TGAGCAACTG ACTGAAATGC CTCAAAATGT TCTTTACGAT GCCATTGGGA 18 0 

TATATCAACG GTGGTATATC CAGTGATTTT TTTCTC C ATT TTAGCTTCCT TAGCTCCTGA 24 0 

AAATCTCGAT AACTCAAAAA ATACGCCCGG TAGTGATCTT ATTTCATTAT GGTGAAAGTT 300 

GGAACCTCTT ACGTGCCGAT CAACGTCTCA TTTTCGCCAA AAGTTGGCCC AGGGCTTCCC 36 0 

GGTATCAACA GGGACACCAG GATTTATTTA TTCTGCGAAG TGATCTTCCG TCACAGGTAT 42 0 

TTATTCGGCG CAAAGTGCGT CGGGTGATGC TGCCAACTTA CTGATTTAGT GTATGATGGT 480 

GTTTTTGAGG TGCTCCAGTG GCTTCTGT T'T C 1 A1CAUCTG TCCCTCCTGT TCAGCTACTG S40 

ACGGGGTGGT GCGTAACGGC AAAAGCACCG CCGGACATCA GCGCTAGCGG AGTGTATACT 6 00 

GGCTTACTAT GTTGGCACTG ATGAGGGTGT CAGTGAAGTG CTTCATGTGG CAGGAGAAAA 660 

AAGGCTGCAC CGGTGCGTCA GCAGAATATG TGATACAGGA TATATTCCGC TTCCTCGCTC 720 

ACTGACTCGC TACGCTCGGT CGTTCGACTG CGGCGAGCGG AAATGGCTTA CGAACGGGGC 780 

GGAGATTTCC TGGAAGATGC CAGGAAGATA CTTAACAGGG AAGTGAGAGG GCCGCGGCAA 840 

AGCCGTTTTT CCATAGGCTC CGCCCCCCTG ACAAGCATCA CGAAATCTGA CGCTCAAATC 900 
AGTGGTGGCG AAACCCGACA GGACTATAAA GATACCAGGC GTTTCCCCTG GCGGCTCCCT 96 0 

CGTGCGCTCT CCTGTTCCTG CCTTTCGGTT TACCGGTGTC ATTCCGCTGT TATGGCCGCG 10 2 0 

TTTGTCTCAT TCCACGCCTG ACACTCAGTT CCGGGTAGGC AGTTCGCTCC AAGCTGGACT 10 8 0 

GTATGCACGA ACCCCCCGTT CAGTCCGACC GCTGCGCCTT ATCCGGTAAC TATCGTCTTG 114 0 

AGTCCAACCC GGAAAGACAT GCAAAAGCAC CACTGGCAGC AGCCACTGGT AATTGATTTA 12 00 

GAGGAGTTAG TCTTGAAGTC ATGCGCCGGT TAAGGCTAAA CTGAAAGGAC AAGTTTTGGT 1260 

GACTGCGCTC CTCCAAGCCA GTTACCTCGG TTCAAAGAGT TGGTAGCTCA GAGAACCTTC 13 2 0 

GAAAAACCGC CCTGCAAGGC GGTTTTTTCG TTTTCAGAGC AAGAGATTAC GCGCAGACCA 13 8 0 

AAACGATCTC AAGAAGATCA TCTTATTAAT CAGATAAAAT ATTTCTAGAT TTCAGTGCAA 14 4 0 

TTTATCTCTT CAAATGTAGC ACCTGAAGTC AGCCCCATAC GATATAAGTT GTAATTCTCA 1500 

TGTTTGACAG CTTATCATCG GATGGATCTG AAATTGTAAA CGTTAATATT TTGTTAAATT 156 0 
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CGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA TCGGCAAAAT 1620 
CCCTTATAAA TCAAAAGAAT AGACCGAGAT AGGGTTGAGT GTTGTTCCAG TTTGGAACAA 1680 
GAGTCCACTA TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG 1740 
CGATGGCCCA CTACGTGAAC CATCACCCTA ATCAAGTTTT TTGGGGTCGA GGTGCCGTAA 1800 
AGCACTAAAT CGGAACCCTA AAGGGAGCCC CCGATTTAGA GCTTGACGGG GAAAGCCGGC 1860 
GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG GGCGCTAGGG CGCTGGCAAG 1920 
TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC CGCTACAGGG 1980 
CGCGTCAGAT CCCATCGATA AGCTTTAATG CGGTAGTTTA TCACAGTTAA ATTGCTAACG 2040 
CAGTCAGGCA CCGTGTATGA AATCTAACAA TGCGCTCATC GTCATCCTCG GCACCGTCAC 2100 
CCTGGATGCT GT AGG CAT AG GCTTGGTTAT GCCGGTACTG CCGGGCCTCT TGCGGGATAT 2160 
CGTCCATTCC GACAGCATCG CCAGTCACTA TGGCGTGCTG CTAGCGCTAT ATGCGTTGAT 2220 
GCAATTTCTA TGCGCACCrc TTCTCGGACC ACTGTCCGAC CGCTTlOOOC GCCGCCCAGT 228 0 
CCTGCTCGCT TCGCTACTTG GAGCCACTAT CGACTACGCG ATCATGGCGA CCACACCCGT 2 34 0 
CCTGTGGATC CGCTGGCGAA AGGGGGATGT GCTGCAAGGC GATTAAGTTG GGTAACGCCA 2400 
GGGTTTTCCC AGTCACGACG TTGTAAAACG ACGGCCAGTG AATTGCGGCC GCCCTGCAAG 2460 
GAAGGGAATG TCGCCAACAG CGAAGAGAGT TGGGCAACGG ATGTGCTGGT GGAGGTGATC 2 52 0 
GCCTCCTGAT GATGAGCCGC TCCCGATGTG GTGTCGGGAG CGGTATTTTC TATAAAACTT 2 58 0 
ACCGCTTATT TGAGATATTC ATCGAAAATG TCGAGTAATT CTTGATGTAT ACACGGCCAT 2640 
TCCTGACCTA AATTGACGGT ACACAAGCCA ATATCGAAGC CATTAATTTT ATAACGATGT 2 700 
TTCACTGCGG TATCTACGTG GGGATATATT AATAACCCCC CTATGTTTTC GCCATTTTCA 2760 
GGCTTTAACG ACCATAAGTA ATTCATCAGT TGATAAAGAT TTTGCGAATG AAATTTTTCT 2820 
GTTCCCATTC GTCGTGAAAA AATGCTCTTA TAGTATTTGG CGTCAACGAT AAGTATTTTT 2880 
TCTGATGAGC GAATGGTGAT GTCAGTTTCC ATTCGAGGTA AC AAATT AAG TGACTGATCC 2940 
GATATACTCG ATGCATCCCA TTTTAAATAA GAGCGGGTTG TGTTTGCAGA CGTTAATTC A 3000 
CGACGGCAAA ATTCATAAAG AAACTTTTGA TAAAGTAATG ACATCTCTTT TTCGTTTCTT 3 06 0 
TCAAAATCAT AGAAACGGTA GTGTCCTTTG TTTTGACCTG GAATAGAATT ATTGACGATG 312 0 
AATTTGCAGA CACTGATAAC GAATTTATAA TAACGCGTAT TTTTTCCGCC ATTCAGATAG 3180 
CTGAAATGCT GCGGAGTTAA ATGAAGAGTG CTAATGCCCG GTAATTTTCT ATAAAGTGAA 3240 
CGAGCTTCAT CTCTGATAGT TGAATTTAAC TTTTCATGCT TAATTAATAT GGCTAATGTG 3300 
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CTTTTTATAA 


TTCGGTTAGC 


CAGCGTGTCT 


TCATTAAGCA 


TATCAAAAGT 


ACTGACGGTT 


33G0 


TTCCCATGAT 


TAAGATGGAA 


GCCGCGTATT 


GTTTTAGCAA 


ACTCTATTCG 


CCCTTTGATG 


3420 


CCAGGAATGA 


TCTCGGTGTT 


AGGATTGTAA 


TCAAGCTCAA 


GCCCTCGGCG 


TGAAAGCTGT 


3480 


AAAACCCCTT 


TATTTAATAC 


ATACCCCAGG 


ATATCAAGAA 


GATTGTTACC 


GGGTATGGCT 


3540 


TCAAGGTTTG 


CCTGCTTAAT 


TTCCTGTAAA 


TAACCCCATG 


CATAGGTAAG 


CATGTAATAG 


3600 


ATATTACGGA 


CAGGTATCAC 


GGGCTGTTCC 


ACTATGAGTC 


CCCTAATAAT 


TTGTTGGTCC 


3660 


ATTTCTGTTG 


TTTATAGGGG 


TCATCAAAGA 


AATATTCTTC 


GAGTAAAGGG 


GCGATATCCG 


3720 


TCATCACAAT 


TTC ATTAAG C 


CATTG TGTAT 


CCGGAGAGGT 


GCCATCTTCC 


AACCCACAGC 


3780 


AGAAGTAACT 


ATGCCCAATG 


CGGAATCCTT 


TCCCAAGGAT 


AGTGGCCTCT 


TTGCTGATTT 


3840 


CCTGGTTCAA 


CTCGTTCATT 


TTTTGGCATA 


AAGACTCAAC 


AAATGAAGGT 


TCTGCTTTTT 


3900 


TATTCAGTAA 


AAAATTCCGG 


AACTGTGGTG 


TATCAAAACC 


TGGCTCAATA 


TCTATGAAAG 


3960 


AAAATCGTCT 


GCGTAGGGCA 


TAGTCAACAA 


CGGCCAGAGA 


GCGATCGGCA 


GTATTCATTA 


4020 


AACCGATGAT 


ATAAACATTC 


TCCGGGACAT 


AGAATCGTTC 


TTCATCGTTT 


TCGGAGTAGG 


4080 


TTAGGGGAAC 


AGACCAGTTT 


TCACCTCGTT 


TATCATGTTC 


CATTAACATC 


ATCACTTCGC 


4140 


CAAATACTTT 


ACTGAGATTG 


GCACGATTGA 


TTTCATCTAT 


AATAAAAATA 


TACTTTTTCT 


4200 


CTGGCTGCTC 


TTTAGCTTGC 


TGACAAAAAT 


TGTAAAATAT 


GCCGTCTTTA 


CGTCGGAAGC 


4260 


CGACGCCATT 


CGGACG AT AG 


C^'^TATAA 


AATCCTCATA 


GCTATAAGAT 


TGATGGAACT 


4320 


GAACCATATT 


GACGCGTTGC 


GGAGCCTTTT 


CTCCTGTCAG 


CAAGTAAGCC 


AGACGGCGTG 


4380 


CAACAAAGGT 


TTTTCCAACG 


CCGGGCGGCC 


CCTGGAGGAT 


AATATTTTTT 


TTGATGGTTA 


444 0 


ATCGTTTGAG 


TATCGTCTCT 


ATTGTGGTTT 


CAGGGATAAA 


CAAATCATTT 


AACGCATCTT 


4500 


CCAGACAGTA 


TGATTCAGTT 


TTTGACATAG 


GTGGAATAAC 


ACTCTTGCCA 


GAATTAAATA 


4560 


TTAATTTATA 


GTCGTTGATT 


ATGTTGTCCA 


GCATAGAGGC 


AAATCGGGTG 


TAATCAATAC 


4620 


CCTGTGAGAC 


TTTTTGGGAA 


C AGG CGT AAT 


AGGACTGTCC 


GTATTTTTTA 


GGATATACAC 


4680 


CCGAAGTTGC 


CTGAAAATAC 


TCTGCGATTG 


TTTTAGGTAT 


GTCTGAAGAG 


AACTGCCATT 


4740 


GGGCATGTGG 


TTCATTCGTG 


TCGCTTATAC 


CATAAGCCAA 


AACCAACTCA 


TCAAAATCTT 


4800 


TATAATAGAG 


AATAACGGGA 


TATATACCGT 


TAGAAGCTTC 


CTGACCTTCT 


CCAAGAAATG 


4860 


CAAACCAGGG 


AATAGACGTA 


AAATTACCAT 


AACCGAAACT 


CAATTTTACT 


CGCAGGTTAC 


4920 


GGTAAGACGT 


TGGATAATCT 


TTAGTGGATT 


GCGAACGTTG 


TTGCTGTGCT 


TGCTTAATAA 


4980 


ATTTTTCAAT 


CCAGGGTTGA 


ATAGATTCCA 


TAAGATATGC 


CTTCCTCATT 


GCTAAGCCTC 


5040 
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TATTATCGCT TTCGCAACGT ACTGAAACAA TAGATTTTTA CTGCAAAATC AGACTGGTAA 5100 

ATATTTACTG AGGGGGAAAG TTTCTATTGA GTCAGTGGAA GGCTCCCGGT GGTTAACCGG 516 0 

GAGTAAACGC TGTTACGCGA CTTTCTGTTT ACCGGCAATC ACTCCAATAA ACGCCTGCAC 5 220 

CTGCTTTTGT TTACGCGCCG ACAGTTTGCA CACCTGGCGT AGCGACTGCA TCAGTTCGCT 52 8 0 

CTCCTCGGCG GCGGGTGGTT GGGCGGTGAG GACAATACAG CCTTCCATCA CTTTGACATC 534 0 

TACCGCCGTG CCAGTGGCAA AACCGGCGGC TTCCAGCCAC TGACCTTTCA GGGTGATGGC 54 00 

GGGAATACGG CTGTAATCCG GGTAGCGACT CGCATAACCG ACGGTGACAT GACGGTTATT 54 6 0 

TGCCGGGGAG ACTTCTGCTT CGAACGGTTG TGCAATAGAA TGCGTGTCAG TCATAACTGC 5 52C 

TATTCTCCAG GAATAGTGAT TGTGATTAGC GATGCGGGTG TGTTGGCGCA CATCCGCACC 558 0 

GCGCTAAATA CCTGTATATA TCATCAGTAA ATATGGGGAA AGTCCAGCTA AAAAT AG AAT 564 0 

AAAATGGGCA ATTTCTGGAA TGATTTAAAT ATATTTATGT GGGTTATGAT TGGCGTGAAA 57 0 0 

TAATAAAAAG CGCACCCCAA AGGTGCGCCA G AAAAT AAT G 1 1C AUGATTT TTTACGTGAG 576 0 

GCTTTTTTAC CCCCGCTAGC TGCGCGTTCA GCTTTGATTT TTTCCAGCAA CGCGGCGGCG 582 0 

CTGTTTTCTC CGCTGATCAA ATCCGGGTTT TCGGCCCGCC ACTGGGCGGT AAGTTCACCA 58 8 0 

CGGAACGCTT TTGCCAGGAT GGATTGCGTC AGGTTGTTGA CGCGGGCTAA GGCGTTGTTG 5 94 0 

ACCTGTTTTT CTATGGTGTC GGCGTAGGCG AAGAGTTGCT CGACGCGGCG AACGATTTCG 6000 

GCTTGTTCTT TTACTGGAGG TAATAAAACA ACTTGGGATT TGATATCTTT TCCTGAAATA 6 06 0 

CCTTTTTGAC CAGAAGTTGT TTTCACGCAG TTCATCATTG CATTTCGTGC TGAGGGGGAT 6120 

GAAAAAAATA TTTCGATATA TTCTGGTAAA GCATCTTTGG TTAATCGAGC TCGAATAAGT 6180 

TTATCAGGAT ATAGCAAATT TTGATGTTGT AATTTTTTCA ATAACCCACA AACACCAACA 624 0 

AATTCTAAAC TTCCGTTATA GCGAGTAAAT AAAAG AT CT C CATCTTGTAA TTTGTGGCGG 6300 

TTTAGTT C AC TTTCTGAACA TTCTAGAGTC GACCTGCAGG CATGCAAGCT TGGCGTAATC 6 36 0 

ATGGTCATAG CTGTTTCCTG TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG 64 2 0 

AGCCGGAAGC ATAAAGTGTA AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT 648 0 

TGCGTTGCGC TCACTGCCCG CTTTCCAGTC GGGAAACCTG TCGTGCCAGC GGATCCTCTA 6 54 0 

CGCCGGACGC ATCGTGGCCG GCATCACCGG CGCCACAGGT GCGGTTGCTG GCGCCTATAT 6600 

CGCCGACATC ACCGATGGGG AAGATCGGGC TCGCCACTTC GGGCTCATGA GCGCTTGTTT 666 0 

CGGCGTGGGT ATGGTGGCAG GCCCCGTGGC CGGGGGACTG TTGGGCGCCA TCTCCTTGCA 672 0 

TGCACCATTC CTTGCGGCGG CGGTGCTCAA CGGCCTCAAC CTACTACTGG GCTGCTTCCT 6780 
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AATGCAGGAG 


TCGCATAAGG 


GAGAGCGTCG 


ACCGATGCCC 


TTGAGAGCCT 


TCAACCCAGT 


6840 


CAGCTCCTTC 


CGGTGGGCGC 


GGGGCATGAC 


TATCGTCGCC 


GCACTTATGA 


CTGTCTTCTT 


6900 


TATCATGCAA 


CTCGTAGGAC 


AGGTGCCGGC 


AGCGCTCTGG 


GTCATTTTCG 


GCGAGGACCG 


6960 


CTTTCGCTGG 


AGCGCGACGA 


TGATCGGCCT 


GTCGCTTGCG 


GTATTCGGAA 


TCTTGCACGC 


7 02 0 


CCTCGCTCAA 


GCCTTCGTCA 


CTGGTCCCGC 


CACCAAACGT 


TTCGGCGAGA 


AGCAGGCCAT 


708 0 


TATCGCCGGC 


ATGGCGGCCG 


ACGCGCTGGG 


CTACGTCTTG 


CTGGCGTTCG 


CGACGCGAGG 


714 0 


CTGGATGGCC 


TTCCCCATTA 


TGATTCTTCT 


CGCTTCCGGC 


GGCATCGGGA 


TGCCCGCGTT 


7 2 0 0 


GCAGGCCATG 


CTGTCCAGGC 


AGGTAGATGA 


CGACCATCAG 


GGACAGCTTC 


AAGGATCGCT 


7 2 6 0 


CGCGGCTCTT 


ACCAGCCTAA 


CTTCGATCAT 


TGGACCGCTG 


ATCGTCACGG 


CGATTTATGC 


73 2 0 


CGCCTCGGCG 


AGCACATGGA 


ACGGGTTGGC 


ATGGATTGTA 


GGCGCCGCCC 


TATACCTTGT 


7 3 8 0 


CTGCCTCCCC 


GCGTTGCGTC 


GCGGTGCATG 


GAGCCGGGCC 


ACCTCGACCT 


GAATGGAAGC 


7440 


CGGCGGCACC 


TCGCTAACGG 


ATTCACCACT 


CCAAGAATTG 


GAGCCAATCA 


ATTCTTGCGG 


7 5 0 0 


AGAACTGTGA 


ATGCGCAAAC 


CAACCCTTGG 


CAGAACATAT 


CCATCGCGTC 


CGCCATCTCC 


756 0 


AGCAGCCGCA 


CGCGGCGCAT 


CTCGGGCAGC 


GTTGGGTCCT 


GGCCACGGGT 


GCGCATGATC 


7 6 2 0 


GTGCTCCTGT 


CGTTGAGGAC 


CCGGCTAGGC 


TGGCGGGGTT 


GCCTTACTGG 


TTAGCAGAAT 


7 6 8 0 


GAATCACCGA 


TACGCGAGCG 


AACGTGAAGC 


GACTG CTGCT 


GCAAAACGTC 


TGCGACCTGA 


7 74 0 


GCAACAACAT 


GAATGGTCTT 


CGGTTTCCGT 


f~i t» T> r-r» /-i . ~i m r. > n 
O X X X V_' j X /Wi 


GTCTGGAAAC 


GCGGAAGTCC 


7 6 0 0 


CCTACGTGCT 


GCTGAAGTTG 


CCCGCAACAG 


AGAGTGGAAC 


CAACCGGTGA 


TACCACGATA 


7 8 6 0 


CTATGACTGA 


GAGTCAACGC 


CATGAGCGGC 


CTCATTTCTT 


ATTCTGAGTT 


ACAACAGTCC 


7 9 2 0 


GCACCGCTGC 


CGGTAGCTCC 


TTCCGGTGGG 


CGCGGGGCAT 


GACTATCGTC 


GCCGCACTTA 


7 980 


TGACTGTCTT 


CTTTATCATG 


CAACTCGTAG 


GACAGGTGCC 


GGCAGCGCCC 


AACAGTCCCC 


8 04 0 


CGGCCACGGG 


GCCTGCCACC 


ATACCCACGC 


CGAAACAAGC 


GCCCTGCACC 


ATTATGTTCC 


8100 


GGATCTGCAT 


CGCAGGATGC 


TGCTGGCTAC 


CCTGTGGAAC 


ACCTACATCT 


GTATTAACGA 


816 0 


AGCGCTAACC 


GTTTTTATCA 


GGCTCTGGGA 


GGCAGAATAA 


ATGATCATAT 


CGTGAATTAT 


8 2 2 0 


TACCTCCACG 


GGGAGAGCCT 


GAGCAAACTG 


GCCTCAGGCA 


TTTGAGAAGC 


ACACGGTCAC 


8280 


ACTGCTTCCG 


GTAGTCAATA 


AACCGGTAAA 


CCAGCAATAG 


ACATAAGCGG 


CTATTTAACG 


8340 


ACCCTGCCCT 


GAACCGACGA 


CCGGGTCGAA 


TTTGCTTTCG 


AATTTCTGCC 


ATTCATCCGC 


8400 


TTATTATCAC 


TTATTCAGGC 


GTAGCAACCA 


GGCGTTTAAG 


GGCACCAATA 


ACTGCCTTAA 


8460 


AAAAATTACG 


CCCCGCCCTG 


CCACTCATCG 


CAGTACTGTT 


GTAATTCATT 


AAGCATTCTG 


8520 
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CCGACATGGA AGCCATCACA GACGGCATGA TGAACCTGAA TCGCCAGCGG CATCAGCACC 8 58 0 

TTGTCGCCTT GCGTATAATA TTTGCCCATG GTGAAAACGG GGGCGAAGAA GTTGTCCATA 8640 

TTGGCCACGT TTAAATCAAA ACTGGTGAAA CTCACCCAGG GATTGGCTGA GACGAAAAAC 8 7 00 

AT ATT CT C AA TAAACCCTTT AGGGAAATAG GCCAGGTTTT CACCGTAACA CGCCACATCT 87 6 0 

TGCGAATATA TGTGTAGAAA CTGCCGGAAA TCGTCGTGGT ATTCACTCCA GAGCGATGAA 88 2 0 

AACGTTTCAG TTTGCTCATG GAAAACGGTG TAACAAGGGT GAACACTATC CCATATCACC 88 8 0 

8906 

AGCTCACCGT CTTTCATTGC CATACG 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(iii MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " P RM2 (target plasmid) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGO"AAC GCAATTAATG TGAGTTAGCT 
CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT TGTGTGGAAT 
TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC CATGATTACG AATTCGAGCT 
CGGTACCCGG GGATCCTCTA GAGTCGAGAT GCCGCATGTG GAAGAGGTGA TTGCACCGAT 
CTTCTACACC GTTCCGCTGC AGCTGCTGGC TTACCATGTC GCGCTGATCA AAGGCACCGA 
CGTTGACCAG CCGCGTAACC TGGCAAAATC GGTTACGGTT GAGTAATAAA TGGATGCCCT 
GCGTAAGCGG GGCATTTTTC TTCCTGTTAT GTTTTTAATC AAACATCCTG CCAACTCCAT 
GTGACAAACC GTCATCTTCG GCTACTTTTT CTCTGTCACA GAATGAAAAT TTTCTGTCAT 
CTCTTCGTTA TTAATGTTTG TAATTGACTG AATATCAACG CTTATTTAAA TCAGACTGAA 
GACTTATCTC TCTCTGTCAT AAAACTG T C A TATTCCTTAC ATATAACTGT CACCTGTTTG 
TCCTATTTTG CTTGTCGTAG CCAACAAACA ATGCTTTATG AATCCTCCCA GGAGACATTA 
TGAAAGTTAT GCGTACCACC GTCGCAACTG TTGTCGCCGC GACCTTATCG ACCTGCAGGC 
ATGCAAGCTT GGCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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CCCAACTTAA 


TCGCCTTGCA 


GCACATCCCC 


CTTTCGCCAG 


CTGGCGTAAT 


AGCGAAGAGG 


900 


CCCGCACCGA 


TCGCCCTTCC 


CAACAGTTGC 


GCAGCCTGAA 


TGGCGAATGG 


CGCCTGATGC 


960 


GGTATTTTCT 


CCTTACGCAT 


CTGTGCGGTA 


TTTCACACCG 


CATATGGTGC 


ACTCTCAGTA 


1020 


CAATCTGCTC 


TGATGCCGCA 


TAGTTAAGCC 


AGCCCCGACA 


CCCGCCAACA 


CCCGCTGACG 


1080 


CGCCCTGACG 


GGCTTGTCTG 


CTCCCGGCAT 


CCGCTTACAG 


ACAAGCTGTG 


ACCGTCTCCG 


1140 


GGAGCTGCAT 


GTGTCAGAGG 


TTTTCACCGT 


CATCACCGAA 


ACGCGCGAGA 


CGAAAGGGCC 


1200 


TCGTGATACG 


CCTATTTTTA 


TAGGTTAATG 


TCATGATAAT 


AATGGTTTCT 


TAGACGTCAG 


12 6 0 


GTGGCACTTT 


TCGGGGAAAT 


GTGCGCGGAA 


CCCCTATTTG 


TTTATTTTTC 


TAAATACATT 


1320 


CAAATATGTA 


TCCGCTCATG 


AGACAAT AA 1 Z 


CCTGATAAAT 


GCTTCAATAA 


TATTGAAAAA 


1380 


GGAAGAGTAT 


GAGTATTCAA 


CATTTCCGTG 


TCGCCCTTAT 


TCCCTTTTTT 


GCGGCATTTT 


1440 


GCCTTCCTGT 


TTTTGCTCAC 


CCAGAAACGC 


TGGTGAAAGT 


AAAAGATGCT 


GAAGATCAGT 


1500 


rr-i /-< /—% /~t m c~» /-i 


AGTGGGTTAC 


ATCGAAu'i 


ATCTCAACAG 


CGGTAAGATC 


CTTGAGAGTT 


1560 


TTCGCCCCGA 


AGAACGTTTT 


CCAATGATGA 


GCACTTTTAA 


AGTTCTGCTA 


TGTGGCGCGG 


16 2 0 


TATTATCCCG 


TATTGACGCC 


GGGCAAGAGC 


AACTCGGTCG 


CCGCATACAC 


TATTCTCAGA 


1680 


ATGACTTGGT 


TGAGTACTCA 


CCAGTCACAG 


AAAAGCATCT 


TACGGATGGC 


ATGACAGTAA 


17 4 0 


GAGAATTATG 


CAGTGCTGCC 


ATAACCATGA 


GTGATAACAC 


TGCGGCCAAC 


TTACTTCTGA 


1 8 0 0 


CAACGATCGG 


AGGACCGAAG 


GAGCTAACCG 


CTTTTTTGlA 


C AA rATGGGG 


GATCATGT AA 


1 B 6 0 


CTCGCCTTGA 


TCGTTGGGAA 


CCGGAGCTGA 


ATGAAGCCAT 


AC C AAA CG AC 


GAGCGTGACA 


3 92 0 


CCACGATGCC 


TGTAGCAATG 


GCAAC/iACGT 


TGCGCAAACT 


ATTAACTGGC 


GAACTACTTA 


198 0 


CTCTAGCTTC 


CCGGCAACAA 


TTAATAGACT 


GGATGGAGGC 


GGATAAAGTT 


GCAGGACCAC 


2 04 0 


TTGTGCGCTC 


GGCCCTTCCG 


GCTGGCTGGT 


TTATTGCTGA 


TAAATCTGGA 


GCCGGTGAGC 


2100 


GTGGGTCTCG 


CGGTATCATT 


GCAGCACTGG 


GGCCAGATGG 


TAAGCCCTCC 


CGTATCGTAG 


216 0 


TTATCTACAC 


GACGGGGAGT 


CAGGCAACTA 


TGGATGAACG 


AAATAGACAG 


ATCGCTGAGA 


2220 


TAGGTGCCTC 


ACTGATTAAG 


CATTGGTAAC 


TGTCAGACCA 


AGTTTACTCA 


TATATACTTT 


2 28 0 


AGATTGATTT 


AAAACTTCAT 


TTTTAATTTA 


AAAGGATCTA 


GGTGAAGATC 


CTTTTTGATA 


2340 


ATCTCATGAC 


CAAAATCCCT 


TAACGTGAGT 


TTTCGTTCCA 


CTGAGCGTCA 


GACCCCGTAG 


2400 


AAAAGATCAA 


AGGATCTTCT 


TGAGATCCTT 


TTTTTCTGCG 


CGTAATCTGC 


TGCTTGCAAA 


2460 


CAAAAAAACC 


ACCGCTACCA 


GCGGTGGTTT 


GTTTGCCGGA 


TCAAGAGCTA 


CCAACTCTTT 


2 52 0 


TTCCGAAGGT 


AACTGGCTTC 


AGCAGAGCGC 


AGATACCAAA 


TACTGTCCTT 


CTAGTGTAGC 


2580 
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CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 2 64 0 

TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA 2 7 0 0 

GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC 2 76 0 

CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA 2 82 0 

GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 288 0 

CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG 2 94 0 

GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 3 000 

TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCC T GGCCTTTTGC TGGCCTTTTG 3 06 0 

CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG 312 0 

AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG 318 0 

AAGCGGAAGA 319 0 
(2) INFORMATION fUK SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(P) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "Oligonucleotide { NLC 9 5 ) 
used to analyze products of transposition." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
ATAATCCTTA AAAACTCCAT TTCCACCCCT 30 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide (NLC209) 
used to analyze products of transposition" 



{Xl ) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



2L 



GTGATTGCAC CGATCTTCTA CACCGTTCC 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide 
used to analyze products of transposition" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TTTCACCGTC AT C AC CG AAA CGCGCGAGAC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "Oligonucleotide 
used to analyze products or: transposition" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 : 
AATGACTTGG TTGAGTACTC ACCAGTCACA 
(2) INFORMATION FOR SEQ ID NO : 1 1 • 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION, /desc = "Oligonucleotide 
used to analyze the products of t ranspos t ion " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 
ATGAACGAAA T AG AC AG AT C GCTGAGATAG 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

used to £\ DESCR1 * TI ™-- /desc = "Oligonucleot ide 
used to analyze products of transposition" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAAGACGATA GTTACCGGAT AAGGCGCAGC 
(2) INFORMATION FOR SEQ ID NO: 13: 



SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
used , (A) DES ^"TION: /desc = "Oligonucleotide , 
used for sequence determination 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 3 
AAAGTCCAGT ATGCTTTTTC ACAGCATAAC 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Asn Tyr Asn Arg Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO : 1 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 5 : 

Asn Tyr Thr Arg Asn 
1 ^ 5 
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